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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/973, 303 



DATE: 05/15/98 
TIME: 13:13:42 



INPUT SET: S2584Lraw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



7*. 



1 SEQUENCE LISTING 
2 

3 (1) General Information: 
4 

5 (i) APPLICANT: Peter DORMER 
6 

7 (ii) TITLE OF INVENTION: PROTEIN WITH DIFFERENTIATION- INDUCING 

8 ACTIVITY FOR FRIEND'S ERYTHROLEUKEMIA CELL LINES 
9 

10 (iii) NUMBER OF SEQUENCES: 10 
11 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: LOWE, PRICE, LEBLANC & BECKER 

14 (B) STREET: 99 Canal Center Plaza, Suite 300 

15 <C) CITY: Alexandria 

16 <D) STATE: VA 

17 { E ) COUNTRY: USA 

18 <F) ZIP: 22314 
19 

20 (v) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Floppy disk 

22 (B) COMPUTER: IBM PC compatible 

23 (C) OPERATING SYSTEM: PC- DOS/MS -DOS 

24 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
25 

26 (vi) CURRENT APPLICATION DATA: 

27 (A) APPLICATION NUMBER: 

28 (B) FILING DATE: 

2 9 <C) CLASSIFICATION: 
30 

31 (viii) ATTORNEY/ AGENT INFORMATION: 

32 (A) NAME: Presta , Frank P. 

3 3 <B) REGISTRATION NUMBER: 19,82 8 

34 <C) REFERENCE/ DOCKET NUMBER: 3428-005 
35 

3 6 (ix) TELECOMMUNICATION INFORMATION: 

37 { A ) TELEPHONE: (703) 684-1111 

38 <B) TELEFAX: (703) 684-1124 
39 

40 
41 

42 (2) INFORMATION FOR SEQ ID NO: 1: 
43 

44 (i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 1495 base pairs 

46 (B) TYPE: nucleic acid 



ENTERED 



PAGE: 2 RAW SEQUENCE LISTING DATE: 05/15/98 

PATENT APPLICATION US/08/973,303 TIME: 13:13:42 

INPUT SET: S2584I.raw 

47 (C) STRANDEDNESS: single 

48 ( D ) TOPOLOGY: linear 
49 

50 (ii) MOLECULE TYPE: cDNA to mRNA 

51 

52 (iii) HYPOTHETICAL: YES 

53 

54 (iv) ANTI-SENSE : NO 

55 

5 6 (vi) ORIGINAL SOURCE: 

5 7 (A) ORGANISM: Mus musculus 
58 

59 
60 

61 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

62 

6 3 CCGACCGTGC GGACTTAAGA TGGAGGCACT TCCTGTCTGC GGCGGGAAGA GAAGGCTCGG 60 
64 

6 5 TCGGAGCCGG GAATGCTGGG ACTTGTAGTG CGTAGTCAAT GGTTCTCTAT GGGCTTTCAG 120 
66 

6 7 AGTGAGTGGC GGGAAGGCGG CCCCGAGGCA TGCTGGGAGT TGTAGTCCTG CCGTCGTCAA 180 
68 

6 9 TGGTTCTCTA TGGGCTTTCA GAGTGAGTGG CGGGAAGGCG GCCCCGAGGC ATGCTGGGAG 24 0 
70 

71 TTGTAGTCCT GCCATAGTCA ATGGTTCTCT ATGGGCTTTC AGACTGAGTG GCGGGAAGGC 300 
72 

7 3 GGCCCCGAGG CATGCTGGGA GTTGCAGCGC CATGTTTTAA AGCACGCGTT TCTCTGTATA 360 
74 

7 5 GACCTGGCTG TGGATTTTTC GCTAATTCTT TTTTTTAGCT TTATTTTTAA TTTTTACTTT 420 
76 

7 7 TTCACACAGG ATTTCTCTTT ATAGCCTTGG CTACCGTTTT TTCCCTAATT ATTCTCCTTT 4 80 

78 

7 9 TCATTTTGGT TTATTTTTTT TTAATTTTGG TTTTTTTAAG ACAGGGTTTC TCTGTATAGA 54 0 
80 

81 CCTGGCTGTG GATTTCTCAC TAATTATTTT TTTTAGCTTT ATTTTTAATT TTTACTTTTT 600 
82 

8 3 CACACAGGAT TTCTCTTTAT AGCCTTGGCT ACCGTTTTTT CCGTAATTAT TCTTATTTTC 66 0 
84 

85 ATTTTGGTTT ATTTTTTAAT TTTAATTTTT GATTTTGGAG ACAGGGTTTC TCTTTTAGCC 720 
86 

87 GCAGCTATGG TTTCTGCCCT AATTATTCTT GTCCTTATTT GTAATTTAAT TCTTAATTTA 7 80 

88 

8 9 ATTTAATTTA TAATTTTGTT GTAAGTTTTT CTGTGGGCGT GAATGGAAAG TCTAACCCGT 84 0 
90 

91 GTTTCTCTGT TCAGCGTCCG CCGGTCACGG CCGCCGCCCC CAGCGACGTC ACCCACACGC 900 
92 

9 3 GCAGAAGCGG ACGCCGCGGT CAAGATGTCT CTGCCATGCC CACGGGACGC ACGGACGCAC 96 0 
94 

9 5 GGACGGACGG ACGGACTCCA CAAGGTAGGA AGCCTGCGCC GACCGCACCG CCGCACCCAC 1020 
96 

97 CACAGCACAC AGGACACACG CGGGCCCCGC GCCCGCCCAG GCACACGCGG CACACACGGC 1080 
98 

9 9 ACACACGGCA GGCAGGCCAG GCACACGCAT CCGCAGGACC CGCCGCACCC GCCACGCAGA 1140 



PAGE: 3 



100 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 
152 



RAW SEQUENCE LISTING DATE: 05/15/98 

PATENT APPLICATION US/08/973,303 TIME: 13:13:44 

INPUT SET: S25841,raw 

CACGGACGAG CCGCCGCGGT CAAGATGTTC ACCCGCCGCG GTCAAGATGT ATGTGCCACC 1200 

GACCCTCGCC CCGCTGGACG GACGGACGGA CGCACGCACG CCGTCAGCGT CCACCGGTCA 12 60 

CTGCCGCCGC CCACAGTGAT GTCACCCACG AAAGCACACA CGTAGAAGCG GACGCCGTGG 13 20 

TCAAGATGTC TCTGCCATCC CCACAGGACG GACGGACGGA CTCCACAAGG TGCGCGTGTC 13 80 

GCCGAGGCCG CCAGGACGGA GCGATTCTCA CGGAGGAAGG AGCACGCCAA CAGGGCCTGA 14 4 0 

CTGCGTACAG ACATGTCCCC CTCAATAAAA TTGCAGTTGA AATGGAAAAA AAAAA 14 95 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 155. .688 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CGCGCCCGCC CGGGATCCCC AGCTGCCGCC GCGCCCGCCC GCCCGCCCGG GGCCCCCGCT 60 

GCAGAACCGT GACCGTCCGC CGGTCACGGC CGCCGCCCCC AGCGACGTCA CCCACACGCG 120 

CAGAAGCGGA CGCCGCGGTC AAGATGTCTC TGCC ATG CCC ACG GGA CGC ACG 172 

Met Pro Thr Gly Arg Thr 
1 5 

GAC GCA CGG ACG GAC GGA CTG ACT CCA CAA GGT AGG AAG CCT GCG CCG 220 
Asp Ala Arg Thr Asp Gly Leu Thr Pro Gin Gly Arg Lys Pro Ala Pro 
10 15 20 

ACC GCA CCG CCG CAC CCA CCA CAG CAC ACA GGA CAC ACG CGG GCC CCG 268 
Thr Ala Pro Pro His Pro Pro Gin His Thr Gly His Thr Arg Ala Pro 
25 30 35 

CGC CCG CCC AGG CAC ACG CGG CAC ACA CGG CAC ACA CGG CAG GCA GGC 316 
Arg Pro Pro Arg His Thr Arg His Thr Arg His Thr Arg Gin Ala Gly 
40 45 50 
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RAW SEQUENCE LISTING DATE: 05/ 1 5/98 

PATENT APPLICATION US/08/973,303 TIME: 13:13:45 

INPUT SET: S25841.raw 

CAG GCA CAC GCA TCC GCA GGA CCC GCC GCA CCC GCC ACG CAG ACA CGG 3 64 

Gin Ala His Ala Ser Ala Gly Pro Ala Ala Pro Ala Thr Gin Thr Arg 
55 60 65 70 

ACG AGC CGC CGC GGT CAA GAT GTT CAC CCG CCG CGG TCA AGA TGT ATG 412 
Thr Ser Arg Arg Gly Gin Asp Val His Pro Pro Arg Ser Arg Cys Met 
75 80 85 

TGC CAC CGA CCC TCG CCC CGC TGG ACG GAC GGA CGG ACG CGC GCA CGC 460 
Cys His Arg Pro Ser Pro Arg Trp Thr Asp Gly Arg Thr Arg Ala Arg 
90 95 100 

CGT CAG CGT CCA CCG GTC ACT GCC GCC GCC CAC AGT GAC GTC ACC CAC 5 08 

Arg Gin Arg Pro Pro Val Thr Ala Ala Ala His Ser Asp Val Thr His 
105 110 115 

GAA AGC ACA CAC GTA GAA GCG GAC GCC GTG GTC AAG ATG TCT CTG CCA 556 
Glu Ser Thr His Val Glu Ala Asp Ala Val Val Lys Met Ser Leu Pro 
120 125 130 

TCC CCA CAG GAC GGA CGG ACG GAC TCC ACA AGG TGC GCG TGT CGC CGA 6 04 

Ser Pro Gin Asp Gly Arg Thr Asp Ser Thr Arg Cys Ala Cys Arg Arg 
135 140 145 150 

GGC CGC CAG GAT GGA GCG ATT CTC ACG GAG GAA GGA GCA CGC CAA CAG 652 
Gly Arg Gin Asp Gly Ala lie Leu Thr Glu Glu Gly Ala Arg Gin Gin 
155 160 165 

GGC CTG ACT GCG TAC AGA AAT GCC CCC CCT CAA TAA AATTGCAGTT 6 98 

Gly Leu Thr Ala Tyr Arg Asn Ala Pro Pro Gin * 
170 175 

GAAATGGAAA AAAAAAA 715 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met Pro Thr Gly Arg Thr Asp Ala 
1 5 

Gly Arg Lys Pro Ala Pro Thr Ala 
20 

Gly His Thr Arg Ala Pro Arg Pro 

35 40 



Arg Thr Asp Gly Leu Thr Pro Gin 

10 15 

Pro Pro His Pro Pro Gin His Thr 

25 30 

Pro Arg His Thr Arg His Thr Arg 
45 



PAGE: 5 RAW SEQUENCE LISTING DATE: 05/15/98 

PATENT APPLICATION US/08/973,303 TIME: 13:13:46 

INPUT SET: S25841.mw 

206 

207 His Thr Arg Gin Ala Gly Gin Ala His Ala Ser Ala Gly Pro Ala Ala 

208 50 55 60 
209 

210 Pro Ala Thr Gin Thr Arg Thr Ser Arg Arg Gly Gin Asp Val His Pro 

211 65 70 75 80 
212 

213 Pro Arg Ser Arg Cys Met Cys His Arg Pro Ser Pro Arg Trp Thr Asp 

214 85 90 95 
215 

216 Gly Arg Thr Arg Ala Arg Arg Gin Arg Pro Pro Val Thr Ala Ala Ala 

217 100 105 110 
218 

219 His Ser Asp Val Thr His Glu Ser Thr His Val Glu Ala Asp Ala Val 

220 115 120 125 
221 

222 Val Lys Met Ser Leu Pro Ser Pro Gin Asp Gly Arg Thr Asp Ser Thr 

223 130 135 140 
224 

225 Arg Cys Ala Cys Arg Arg Gly Arg Gin Asp Gly Ala lie Leu Thr Glu 

226 145 150 155 160 
227 

228 Glu Gly Ala Arg Gin Gin Gly Leu Thr Ala Tyr Arg Asn Ala Pro Pro 

229 165 170 175 
230 

231 Gin 

232 

233 

2 34 (2) INFORMATION FOR SEQ ID NO: 4: 
235 

2 36 (i) SEQUENCE CHARACTERISTICS: 

237 (A) LENGTH: 636 base pairs 

238 (B) TYPE: nucleic acid 

239 (C) STRANDEDNESS: single 

240 (D) TOPOLOGY: linear 
241 

242 (ii) MOLECULE TYPE: cDNA to mRNA 

243 

244 (iii) HYPOTHETICAL: YES 

245 

246 

24 7 (ix) FEATURE: 

24 8 (A) NAME /KEY : CDS 

249 (B) LOCATIONS . .636 

250 

251 

252 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

253 

254 ATG GGG CTG CAG AAC CGT GAC CGT CCG CCG GTC ACG GCC GCC GCC CCC 48 

255 Met Gly Leu Gin Asn Arg Asp Arg Pro Pro Val Thr Ala Ala Ala Pro 

256 180 185 190 
257 

258 AGC GAC GTC ACC CAC ACG CGC AG A AGC GGA CGC CGC GGT CAA GAT GTC 9 6 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 05/15/98 

PATENT APPLICATION US/08/973,303 TIME: 13:13:47 

INPUT SET: S25841.raw 



Line Error Original Text 



